Reasoning and identifying relevant matches for XML keyword search

نویسندگان

Ziyang Liu

Yi Chen

چکیده

Keyword search is a user-friendly mechanism for retrieving XML data in web and scientific applications. An intuitively compelling but vaguely defined goal is to identify matches to query keywords that are relevant to the user. However, it is hard to directly evaluate the relevance of query results due to the inherent ambiguity of search semantics. In this work, we investigate an axiomatic framework that includes two intuitive and non-trivial properties that an XML keyword search technique should ideally satisfy: monotonicity and consistency, with respect to data and query. This is the first work that reasons about keyword search strategies from a formal perspective. Then we propose a novel semantics for identifying relevant matches, which, to the best of our knowledge, is the only existing algorithm that satisfies both properties. An efficient algorithm is designed for realizing this semantics. Extensive experimental studies have verified the intuition of the properties and shown the effectiveness of the proposed algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reasoning and Identifying Relevant Matches

متن کامل

Challenges, Techniques and Directions in Building XSeek: an XML Search Engine

The importance of supporting keyword searches on XML data has been widely recognized. Different from structured queries, keyword searches are inherently ambiguous due to the inability/unwillingness of users to specify pinpoint semantics. As a result, processing keyword searches involves many unique challenges. In this paper we discuss the motivation, desiderata and challenges in supporting keyw...

متن کامل

Faster Algorithms for Searching Relevant Matches in XML Databases

Keyword search is a friendly mechanism for the end user to identify interesting nodes in XML databases, and the SLCA (smallest lowest common ancestor)-based keyword search is a popular concept for locating the desirable subtrees corresponding to the given query keywords. However, it does not evaluate the importance of each node under those subtrees. Liu and Chen proposed a new concept contribut...

متن کامل

Retrieving Reusable Software Components Using Enhanced Representation of Domain Knowledge

This paper describes an ontology-based approach for identifying and retrieving relevant software components in large reuse libraries. Since it is usually difficult to precisely identify exact matches without considering domain knowledge, we exploit the use of domainspecific ontologies to enrich a knowledge base initially populated with multi-faceted ontological descriptions of API components. I...

متن کامل

Ranking Friendly Result Composition for XML Keyword Search

This paper addresses an open problem of keyword search in XML trees: given relevant matches to keywords, how to compose query results properly so that they can be effectively ranked and easily understood by users. The approaches adopted in the literature are oblivious to user search intention, making ranking schemes ineffective on such results. Intuitively, each query has a search target and ea...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

PVLDB

دوره 1 شماره

صفحات -

تاریخ انتشار 2008

Reasoning and identifying relevant matches for XML keyword search

نویسندگان

چکیده

منابع مشابه

Reasoning and Identifying Relevant Matches

Challenges, Techniques and Directions in Building XSeek: an XML Search Engine

Faster Algorithms for Searching Relevant Matches in XML Databases

Retrieving Reusable Software Components Using Enhanced Representation of Domain Knowledge

Ranking Friendly Result Composition for XML Keyword Search

عنوان ژورنال:

اشتراک گذاری